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In this paper we study the geometry of graph spaces endowed with a 
special class of graph edit distances. The focus is on geometrical results 
useful for statistical pattern recognition. The main result is the Graph 
Representation Theorem. It states that a graph is a point in some geo¬ 
metrical space, called orbit space. Orbit spaces are well investigated and 
easier to explore than the original graph space. We derive a number of 
geometrical results from the orbit space representation, translate them to 
the graph space, and indicate their significance and usefulness in statistical 
pattern recognition. 


1 Introduction 

The graph edit distance is a common and widely applied distance function for pattern 
recognition tasks in diverse application areas such as computer vision, chemo- and 
bioinformatics [16] . One persistent problem of graph spaces endowed with the graph 
edit distance is the gap between structural and statistical methods in pattern recogni¬ 
tion Gum. This gap refers to a shortcoming of powerful mathematical methods that 
combine the advantages of structural representations under edit transformations with 
the advantages of statistical methods defined on Euclidean spaces. 

One reason for this gap is an insufficient understanding of the geometry of graph 
spaces endowed with the graph edit distance. Few exceptions towards a better under¬ 
standing of graph spaces are, for example, theoretical results presented in umm- 
However, a sound theory towards statistical graph analysis in the spirit of [25, 30! 
for complex objects, [ 121130 ! for tree-structured data, and ilO; [22] for shapes is still 
missing. 

Here, we study the geometry of graph spaces with the goal to establish a mathemat¬ 
ical foundation for statistical analysis on graphs. The basic ideas of this contribution 
build upon M and are inspired by tuna mi. The graphs we study comprise directed 
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as well as undirected graphs. Nodes and edges may have attributes from arbitrary sets 
such as, for example, real values, feature vectors, discrete symbols, strings, and mix¬ 
tures thereof. We assume that graphs have bounded number of nodes. 

The key result is the Graph Representation Theorem 3.1 formulated for graph edit 
kernel spaces. A graph edit kernel space is a graph space endowed with a geometric 
version of the graph edit distance. Theorem |3.1| is useful, because it provides deep 
insight into the geometry of graph spaces and simplifies derivation of many interesting 
results relevant for narrowing the gap between structural and statistical pattern recog¬ 
nition, which would otherwise be a complicated endeavour when done in the original 
graph space. 

The Graph Representation Theorem |3.1| states that a graph is a point in a geo¬ 
metrical space, called orbit space. The geometry and topology of orbit spaces is well 
investigated and much easier to explore than those of graph spaces. Based on the 
Graph Representation Theorem |3.1| we show that the graph space is a geodesic space, 
prove a weak form of the Cauchy-Schwarz inequality and derive basic geometrical con¬ 
cepts such as the length, angle, and orthogonality. Then we present geometrical results 
from the point of view of a generic graph. One result is the weak version of Theo¬ 
rem [XT] It states that the graph space looks like a convex polyhedral cone from the 
perspective of a generic graph. This result is useful, because it supports geometric in¬ 
tuition for deriving further results. Finally, we indicate the significance and usefulness 
of the derived geometrical results for statistical pattern recognition on graphs. 

The paper is structured as follows: Section 2 introduces attributed graphs, the graph 
edit distance and graph edit kernels. In Section 3, we present the Graph Representa¬ 
tion Theorem and derive general geometric results. Then in Section 4, we study the 
geometry of graph edit kernel spaces from the point of view of generic graphs. Finally, 
Section 5 concludes with a summary of the main result and indicates how the derived 
geometric results can be used in statistical pattern recognition. 


2 Graph Edit Kernel Spaces 

This section first introduces attributed graphs and the graph edit distance. To obtain 
graph spaces with a richer mathematical structure, we introduce graph metrics that 
are locally induced by inner products. We show that the derived graph metrics is 
(1) a special subclass of the graph edit distance, and (2) a common and widely used 
graph dissimilarity measure. For this purpose, we use a different formalization than 
presented in the literature. 

2.1 Attributed Graphs 

Let A be the set of node and edge attributes. We assume that A contains two (not 
necessarily distinct) symbols IVy and Ns denoting the null element for nodes and edges, 
respectively. 

Definition 2.1 An attributed graph is a triple X = (V,£\a), where V represents a 
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finite set of nodes, £ C V x V a set of edges, and a : V x V —► A is an attribute function 
satisfying 

1. a(i,j) G A\{N v ,N £ }, if i^j and ( i,j ) G £ 

2. a(i,j) =N £ ifi^j and (■ i,j ) ^ £ 
for all i,j G V. 

The definition of an attributed graph implicitly assumes that graphs are fully connected 
by regarding non-edges as edges with null attribute N £ . Observe that nodes may have 
any attribute from A. The node set of a graph X is referred to as Vx, its edge set as 
£x, and its attribute function as ax- By Qa we denote the set of attributed graphs 
with attributes from A. 

Graphs can be directed and undirected. Attributes for node and edges may come 
from the same as well as from different or disjoint sets. For the sake of simplicity, 
we merged node and edge attributes into a single attribute set. Attributes can take 
any value. Examples are binary attributes, discrete attributes (symbols), continu¬ 
ous attributed (weights), vector-valued attributes, string attributes, and combinations 
thereof. Thus, the definition of attributed graphs is sufficient general to cover a wide 
class of graphs such as binary graphs from graph theory, weighted graphs, molecular 
graphs, protein structures, and many more. 

Definition 2.2 A graph Z is a subgraph of X, if 

1. V z C V x 

2- £ z C £ x 

3. az = axlvzxVz- 

Suppose that X and Y are two graphs with m and n nodes, respectively. We say X 
and Y are size-aligned if both graphs are expanded to size n + m by adding isolated 
nodes with attribute Ny. By X and Y we denote the size-aligned graphs of X and Y. 

Definition 2.3 A morphism <f> : X —> Y between graphs X and Y is a bijective map 

f-Vy^Vy 

between the node sets Vjj and Vy of the size-aligned graphs X and Y, respectively. 

Note that we use that same notation for morphism and its bijective node map. By 
A ix,Y we denote the set of all morpliisms from X to Y. 

Definition 2.4 An isomorphism is a morphism <p : X —» Y between graphs X and Y 
such that 


for all i,j G Vy 


= ay((j)(i),4>(j)) 
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Two graphs X and Y are isomorphic if and only if there is an isomorphism (j): X —> Y 
such that the restriction of </> to the unaligned node set Vx satisfies 

otx (i,j) = ot Y (<P(i), 

for all i,j £ Vx- Thus the definition of isomorphism corresponds to the common 
definition of isomorphism from graph theory. 

2.2 Graph Edit Distance 

Next, we endow the set Q .4 with a graph edit distance function. The basic idea of the 
graph edit distance is to regard a morphism <f> : X —> Y as a transformation of a graph 
X to a graph Y by successively applying edit operations. Possible edit operations 
are insertion, deletion, and substitution of nodes and edges. Each node and edge edit 
operation is associated with a cost given by an edit cost function e : A x A — > R. Then 
the cost of transforming X to Y along morphism cj> is the sum of the underlying edit 
costs. Table [l] provides an overview of different edit operations and the form of their 
edit costs. 

Let £ : A x A —> R be an edit cost function. The cost of transforming X to Y along 
morphism <f >: X —» Y is given by 

5 (/> (X,Y)= ^ £ ( a x( i J)^ ot((/>(*),0(j))). 
iJ&Vx 

The graph edit distance of X and Y minimizes the transformation cost over all possible 
morphisms between X and Y. 

Definition 2.5 Let e : A x A —> R be an edit cost function. The graph edit distance 
is a function 5 : Qa x Q _4 —> R with 

S(X,Y) = min 8AX,Y). 

<t>£Mx,Y 

A graph edit distance space is a pair {Ga,$) consisting of a set of attributed graphs 
together with a graph edit distance. 

2.3 Graph Edit Kernels 

Graph spaces endowed with the graph edit distance are difficult to analyze. To obtain 
spaces that are mathematically more structured, we impose constraints on the set of 
feasible morphisms and the choice of edit costs. 

First, we constrain the set of morphisms to the subset of compact morphisms. 

Definition 2.6 A morphism <f>: X —)■ Y between graphs X and Y is compact, if 

#x)CVy or <t>-\V Y )CV x - 

By C x . y we denote the subset of compact morphisms between X and Y. 
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cost 

meaning 

s(N v , yj j) 

insertion of node yjj 

e(xu,N v ) 

deletion of node Xu 

£ {%ii i Vjj ) 

substitution of node Xu by node yjj 

e(N v ,N v ) 

dummy operation with cost zero 

£ (-^£ ? Vrs ) 

insertion of edge y rs 

£ ip^ij •> ^E ) 

deletion of edge x,j 

£ (p^ij 5 Vrs) 

substitution of edge Xij by edge y rs 

£ (Ns, N £ ) 

dummy operation with cost zero 


Table 1: Overview of cost for edit operations. By a'u,yjj we denote node attributes 
and by Xij,y rs edge attributes. Costs c (Ny, Ny) and c (Ns, Ns) are zero and 
arise by expansion of graphs for mathematical convenience. 


A compact morphism demands that each node of the smaller of both graphs corre¬ 
sponds to a unique node of the larger one. 

Next, we constrain the choice of edit cost via edit scores for measuring the similarity 
of node and edge attributes. We consider edit score functions of the form 

Tdxd->K, (x, y) i-x $(x) T $(y), 

where $ : A —X TL is a feature map into a Hilbert space TL. Then the edit score is 
a positive definite kernel on A. The score of transforming X to Y along a compact 
morphism <f> : X —> Y is given by 

k^(X,Y)= Y k (a^(i,j), ay (<£(*)> 00 '))) ■ 

Maximizing the transformation score over all compact morphisms gives the graph edit 
kernel. 

Definition 2.7 The graph edit kernel is a function k : Ga x Ga —t M with 

k(X,Y) = max k^X, Y). 

r/>&Cx,Y 

As an optimal assignment kernel, the graph edit kernel is not positive definite :2I5], 
but gives rise to a metric. 

Proposition 2.8 A graph edit kernel n induces a metric S : Ga x Ga —•* defined by 

S(X, Y) = y/ K (X, X) + k(Y, Y) - 2k(X, Y) (1) 


for all X , Y e Ga- 
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Proof: Follows from Theorem 13.31 ■ 

We call the graph edit distance 5 defined in Prop. |2.8| the metric induced by the 
graph edit kernel k. A graph edit kernel space is a graph edit distance space (Ga,8), 
where S is a metric induced by a graph edit kernel. 

An equivalent way to derive the metric defined in ([I]) is as follows: suppose that 
k(x, y ) = $(x) T< f>(y) is a positive definite kernel. Define the edit cost function 

£i:dxd-)l, (x,y) ||$(x) - $(?/)|| 2 , 

where the norm ||-|| in 7~L is induced by the inner product in the usual way. Applying 
the kernel-trick gives 

Sk{x, y) = k(x, x) + k(y, y) - 2 k(x, y). (2) 

Then the edit cost function £& induces a graph edit distance that coincides with the 
squared metric defined in ([l]). 

2.4 Examples 

The following examples show that the graph edit kernel and its induced graph edit 
kernel distance are not artificial constructions, but comprise well known and widely 
applied structural (dis)similarity measures for graphs. 

2.4.1 Maximum Common Subgraph 

The first example shows that the maximum common subgraph problem is equivalent 
to the problem of computing a graph edit kernel. For this, we first introduce two 
definitions. 

Definition 2.9 A common subgraph of X and Y is a graph Z that is isomorphic to 
subgraphs Zx of X and Zy ofY. 

Let Sx,y denote the set of all common subgraphs of graphs X and Y. Then a maximum 
common subgraphs of two graphs is a subgraph Z* with maximum number of nodes 
and edges. 

Definition 2.10 A maximum common subgraph of X and Y is a common subgraph 
satisfying 

Z* = arg max \V z \ + \£z\- 
Z(=:Sx,Y 

The function 

k:AxA^R, (x, y) ^ { J ; 

is a positive definite kernel that gives rise to a graph edit kernel 

k{X,Y)= max \V Z \ + \£z\ ■ 
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2.4.2 Geometric Graph Metrics 

Let A = R be the set of weights with null elements TVy = Ng = 0. Then Ga is the 
set of (positively) weighted graphs. We can represent a graph X with n nodes by a 
weighted adjacency matrix X = (%ij) from R nx ” with elements a= ax -(i,j). 

The function 


k : A x A —¥ R, (x, y) K ¥ x ■ y (3) 

is a positive definite kernel as an inner product of the one-dimensional vector space A. 
The induced edit cost function takes the form 

e(x,y) = (x- yf . 

Let X e R" x ™ and Y £ be weighted adjacency matrices of graphs X and Y. 

respectively. Then the graph metric induced by kernel ff is of the form 

S(X, Y) = min ||x-pyp t ||, 

Pen" 11 

where II denotes the set of all (n x to) subpermutation matrices of rank d = min(n, to). 
A subpermutation matrix of rank d is a matrix that satisfies the following conditions: 

1. All matrix elements are from {0,1}. 

2. Each row and each column has at most one element with value 1. 

3. The matrix has exactly d elements with value 1. 

Geometric graph metrics can be easily generalized to the case where node and edge 
attributes are from different Euclidean spaces, say and R 9 . 

3 The Graph Representation Theorem 

The Graph Representation Theorem states that a graph of bounded order can be 
represented as a point in an orbit space. This result is useful, because it provides deep 
insight into the geometry of graph edit kernel spaces and simplifies derivation of many 
results. 

Assumptions. We assume that the underlying edit score k : A x A —► R is defined as 
an inner product of the feature map $ : A —> T~L into the d-dimensional Euclidean space 
T~L = R d . By Qu we denote the space of attributed graphs of all graphs of bounded 
order n with attributes from the feature space We may regard Ga as a subset of 
Gn v i a the feature map 4>. By n : Qu x Gu ~> R we denote the graph edit kernel based 
on edit score k , and by <5 : Gu x Gh —•* ® the metric induced by the graph edit kernel 

K. 

1 The order of a graph is the number of its nodes. 
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3.1 The Graph Representation Theorem 

Let Q be a group with neutral element e. An action of group Q on a set A is a map 

<p : Q x A —> A, ( 7 , a;) 1 —> 7 X 


satisfying 

1 . (7 o Y)x = 7 ( 7 'a;) 

2 . EX = X 

for all 7,7 1 £ G and all x G A. The orbit of x G A under the action £ is the subset of 
A defined by 

[x] = {7X : 7 G £} . 

We write 2/ G [x] to denote that x' is an element of the orbit [x]. The orbit space of 
the action of Q on A is defined to bet the set of all orbits 

X/g = {[x] : x G A}. 

The next result states that each graph can be represented as a points of an orbit 
space. 

Theorem 3.1 (Graph Representation Theorem) A graph edit kernel space (Q-u, 5) 
is isometric to the orbit space A /Q of the action of a group Q of isometries on a Eu¬ 
clidean space A. 

Proof: We present a constructive proof. 

1. First we show that we may assume that all graphs from C/% are of order n without 
loss of generality. Suppose that X = (V, £, a) is a graph. An isolated node i G V is a 
node without connection to any other node, that is 

a(i,j) = a(j,i ) = N e 

for all j G V\ {?'}, where Ng is the null attribute for denoting non-existence of an edge. 
An isolated node i G V is a null-node if a(i,i) = ./Vy, where Ay is the null attribute for 
nodes. Suppose that X' is a graph obtained by removing or adding null-nodes. Then 
by definition, the graphs X and X' are isomorphic. Thus, if X is of order m < n, we 
replace A by a graph X' of order n by augmenting X with n — m null-nodes. 

2. Let A = J{ nxn be the set of all (n x n)-matrices with elements from %. An 
attributed graph X = (V, £, a) is completely specified by a matrix X = ( Xij ) from A 
with elements x,j = a(i,j) for all i,j G V. 

3. The form of matrix X is generally not unique and depends on how the nodes 
are arranged in the diagonal of X. Different orderings of the nodes may result in 
different matrix representations. The set of all possible re-orderings of all nodes of X 
is (isomorphic to) the symmetric group S n , which in turn is isomorphic to the group Q 



of all simultaneous row and column permutations of a matrix from X. Thus, we have 
a group action 

gxX^X, ( 7 ,X)^ 7 X, 

where 7 X denotes the matrix obtained by simultaneously permuting the rows and 
columns according to 7 . For X £ X, the orbit of X is the set defined by 

[X] = { 7 X : 7 € • 


By 


x/g = {[x] : le xj 

we denote the orbit space consisting of all all orbits. The natural projection map is 
defined by 

7 T-.x-> x/g, X [X]. 


4. To emphasize that X is a Euclidean space, we use vector instead of matrix notation 
henceforth. Consequently, we write x instead of X. By [a] we denote the orbit of x. 

5. The quotient topology on X/g is the finest topology for which the projection map 
7 r is continuous. Thus, the open sets U of the quotient topology on X /(? are those sets 
for which 7 r _ 1 (W) is open in X. 

6 . We can endow X/g with the quotient distance defined by 

k 

d(N,[y]) = inf ^2\\xi-yi\\, 

i=1 

where the infimum is taken over all finite sequences (ai, ....x *,) and (yi,... .yi/) with 
[x\\ = [a], [y n \ = [y], and [x/\ = [yi+\] for all i £ {1,..., k — 1}. The distance 
d ([a], [y]) is a pseudo-metric [4j, Lemma 5.20. Since g is finite and acts by isometries, 
the pseudo-metric is a metric of the form 

d{[x], [y]) = min{||a: - y|| : x £ [x\,y £ [y]} 

= min{||a — y|| : x £ [a]} 

= min{||a - y|| : y £ [y]}, 

where y in the second row and x in the third row are arbitrarily chosen representations. 

7. The quotient metric d([a], [y]) on X/g defined in part [6] induces a topology that 
coincides with the quotient topology defined in part [5j 

8 . Consider the map 

x '■ Gw — t X/g , X y—> [x] 

that assigns each graph X to the orbit [a] consisting of all representations of X. The 
map to is surjective, because any matrix x £ X represents a valid graph X. The 
problem of dangling edges is solved by allowing nodes with null attribute. Then the 
orbit [X] consists of all matrices representing X. The map w is also injective. Suppose 
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that X and Y are non-isomorphic graphs with respective representations x and y. If 
x and y are in the same orbit, then there is an isomorphism between X and Y, which 
contradicts our assumption. Thus, w is a bijection. 

9. We have 

S 2 (X,Y)= min ^ \\a x {i,j) - (<£(*), </>(j))|| 2 

(f>£Cx,Y ““ 
i,j€Vx 

= ,H^ii — 2/0(000')II 

<pec x ,Y . f—• 
ij&Vx 

= min||®- 7 (y)|| 2 

7Gy 

= d 2 ([*],[v]) 

for all X, y £ Gu ■ The first equation follows from the definition of a graph edit 
distance induced by a graph edit kernel. The second equation changes the notation 
of the attributes. The third equation follows from the equivalence of morphism and 
group action by construction. Finally, the fourth equation follows from part [5] 

10. The implication of part [9] are twofold: 

1. The distance 5{X,X) induced by the graph edit kernel n is a metric. 

2 . The map ui : Gu —> X/Q defined in part [8] is an isometry. 

This shows the assertion of the theorem. 


Corollary |3.2| summarizes results obtained in the course of proving the Graph Rep¬ 
resentation Theorem. 

Corollary 3.2 

1. The natural projection tt : X —> X/Q is continuous. 

2. The quotient distance d on X/Q is a metric. 

3. The map 

to '■ Gu —t X/Q, X i —'> [a;] 

is a bijective isometry between the metric spaces (Gui&) and (X/G,d). 

4- The distance <5 on Gu induced by the graph edit kernel k is a metric satisfying 

5(X, y) = min{||x — y\\ : x £ to(X),y £ uj(Y)} 

= min{||x — y\\ : x £ to{X)} for all y £ to(Y) 

= min{||x — y\\ : y £ ui(Y)} for all x £ to(X) 

for all X , y £ Gu- 
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5. The graph edit kernel n satisfies 


k(X, Y ) = max {x T y 
= max | x T y 
= max {x T y 


x G ui(X),y G w(Y)} 
x G w(X)} for all y G oj(Y) 
y G w(Y)} for all x G ui(X) 


for all X , Y G On- 


Studying graph edit kernel spaces reduces to the study of orbit spaces X/Q due to 
the Graph Representation Theorem. Though analysis of orbit spaces is more general, 
we translate all results into graph edit kernel spaces to make them directly accessible 
for statistical pattern recognition methods. For this, we identify the graph space Gu 
with the orbit space X/G via the bijective isometry ui : Gu X/Q. We denote this 
relationship by Gu = X/G- Suppose that X G Gu an d x G X. We briefly write x G X 
if ui( A') = [a:]. In this case, we call x a representation of graph X. 

Next, we summarize some results useful for a statistically consistent analysis of 
graphs. 

Theorem 3.3 A graph edit kernel space [Gu^) has the following properties: 

1. (Gn,fi) is a complete metric space. 

2. (Gn,8) is a geodesic space. 

3- (Gn,fi) is locally compact. 

f. Every closed bounded subset of (Gn,3) is compact. 

Proof: Since Gu = X/G it is sufficient to show the assertions for the orbit space X/G- 

1 . Since the group G is finite, all orbits are finite and therefore closed subsets of X. 
The Euclidean space X is a finitely compact metric space. Then X/G is a complete 
metric space |26], Theorem 8.5.2. 

2 . Since X is a finitely compact metric space and G is a discontinuous group of 
isometries, the assertion follows from [26J, Theorem 13.1.5. 

3. Since G is finite and therefore compact group, the assertion follows from [3], 
Theorem 3.1. 

4. Since Gu is a complete, locally compact length space, the assertion follows from 
the Hopf-Rinow Theorem (see e.g.gj. Prop. 3.7). 
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3.2 Length and Angle 

In this section, we introduce basic geometric concepts such as length and angle of 
graphs, which are important for a geometric interpretation and understanding of gen¬ 
eralized linear methods for classification of graphs [201 E3] ■ 

Definition 3.4 The scalar multiplication on Gn is a function 

1 :tx Gu —x C/%, (A,X) > XX , 

where XX is the graph obtained by scalar multiplication of X with all node and edge 
attributes of X. 

In contrast to scalar multiplication on vectors, scalar multiplication on graphs is 
only positively homogeneous. 

Proposition 3.5 Let X £ R+ a non-negative scalar. Then we have 

k(X,X Y) = X k(X,Y) 

for all X,Y £ Gn- 

Proof: From Gu — X/G and Corollary |3.2| follows 

k(X, Y ) = max {x T y : x £ X}, 

where y £ Y is arbitrary but fixed. Suppose that k(X,Y) = xffy for some represen¬ 
tation Xq £ X. Then we have xffy > x T y for all x £ X. This implies 

x o (Ay) = A (xly) > X (x T y) = x T (Xy) 

for all non-negative scalars A £ R + . The assertion follows from A Y = [Ay] if Y = [y]. 

■ 

Using graph edit kernels, we can define the length of a graph in the usual way. 
Definition 3.6 The length £(X) of graph X £ Gn is defined by 

£(X) = y/n(X,X). 

The length of a graph can be determined efficiently, because the transformation score 
of the identity morphism is maximum over all morpliisms from a graph to itself. 

Proposition 3.7 The squared length of X is of the form 

t(x) = 11*11 = V*i<i(x,x) = I J2 Hi,j) 

y hieVx 

for all x £ X. 
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Proof: From Corollary |3.2| follows 

k(X, X) = max {x T x' : x,x' £ X} . 


We have 


x T x' = ||a;|| ||a;'|| coscc, 


where a is the angle between x and x' . Since Q is a group of isometries acting on X, 
we have 


ll*|| = ||*'|| 


for all elements x and x' from the same orbit. Thus, x T x' is maximum if the angle 
a £ [0, 27r] is minimum over all pairs of elements from A'. The minimum angle is zero 
for pairs of identical elements. This shows the assertion. ■ 


The relationship between the length of a graph and the graph edit kernel is given 
by a weak form of the Cauchy Schwarz inequality: 


Theorem 3.8 (Weak Cauchy-Schwarz) Let X,Y £ Qu be two graphs. Then we 
have 

\k(X,Y)\<£(X)-£(Y), 

where equality holds when X and Y are positively dependent. 

Proof: From Qu = X/Q and Corollary |3.2| follows 

k(X , Y ) = max {x T y : x £ X, y £ Y } 


for all X,Y £ Qu- Suppose that x 0 £ X and y 0 £ Y are representations such 
that k(X,Y) = x^yo. From the standard Cauchy-Schwarz inequality together with 
Prop. |3.7| follows 

\k{X,Y)\ = \x%y 0 \ < ||ac 0 || ||y 0 || = £(X)£(Y). 

We show the second assertion, that is equality if X and Y are positively dependent. 
Suppose that X = AT for some A £ R+. From the definition of the length of a graph 
together with Prop. 1 3.5 1 follows 

X 2 £ 2 (X) = A 2 k(X,X) = k(XX, XX) = i 2 {XX). 

Then by using Prop. [375] we obtain 

|k(A, A A) | = Xk(X, X) = X£(X)£{X) = £(X)£(XX). 


The Cauchy-Schwarz inequality is considered to be weak, because equality holds 
only for positively dependent graphs. This is in contrast to the original Cauchy- 
Schwarz inequality in vector spaces, where equality holds, when two vectors are linearly 
dependent. Nevertheless, we can use the weak Cauchy-Schwarz inequality for defining 
an angle between two graphs. 
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Definition 3.9 The cosine of the angle between non-zero graphs X and Y is defined 
by 


cos a = 


<X,Y) 

wen' 


( 4 ) 


With the notion of angle, we can introduce orthogonality between graphs. 

Definition 3.10 Two graphs X and Y are orthogonal, if k(X,Y) = 0. A graph X is 
orthogonal to a subset U C Q w , if 

k(X, Y) - k(X, Z) = 0 


for allY,Z £U. 

3.3 Geometry from a Generic Viewpoint 

In this section, we describe the geometry of a graph edit kernel space from a generic 
viewpoint. A generic property is defined as follows: 

Definition 3.11 A generic property is a property that holds on a dense open set. 

Suppose that the metric space (M, d) is either a Euclidean space or graph edit kernel 
space. The underlying topology that determines the open subsets of M is the topology 
induced by the metric d. The open sets of the topology are all subsets that can be 
realized as the unions of open balls 

B(z,p) = {x € M. : d(z,x) < p}, 

with center z € Xi and radius p > 0. In measure-theoretic terms, a generic property 
is a property that holds almost everywhere , meaning for all points of a set with Borel 
probability measure one. 

3.3.1 Dirichlet Fundamental Domains 

We assume that Q is non-trivial. In the trivial case, we have X = X/Q and therefore 
everything reduces to the geometry of Euclidean spaces. For every x £ X, we define 
the isotropy group of x as the set 

Qx = {7 e Q : 7 x = x} . 

An ordinary point x £ X is a point with trivial isotropy group Q x = {e}. A singular 
point is a point with non-trivial isotropy group. If x is ordinary, then all elements of 
the orbit \x\ are ordinary points. 

A subset T of X is a fundamental set for Q if and only if T contains exactly one 
point x from each orbit [x] G X/Q. A fundamental domain of Q in X is a closed set 
VCX that satisfies 

1 - X = \J yf .glD 
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2. 7 'D 0 n T>° = 0 for all 7 £ Q \ {e}. 


Proposition 3.12 Let z £ X be ordinary. Then the set 

V z = {x £ X : \\x — z\\ < ||® — 72 :11 for all 7 £ G} 
is a fundamental domain, called Dirichlet fundamental domain centered at z. 

Proof: [2B], Theorem 6.6.13. ■ 

Proposition 3.13 LetT> z he a Dirichlet fundamental domain centered at an ordinary 
point z. Then the following properties hold: 

1. T> z is a convex polyhedral cone. 

2. There is a fundamental set T z such that 

V° Z £F Z £ V z . 

3. We have z £ V°. 

4- Every point x £ T>° z is ordinary. 

5. Suppose that ®, 7 ® £ V z for some 7 £ Q \ {e}. Then ®, 7 ® £ dVf. 

6. The Dirichlet fundamental domain can be equivalently expressed as 

D z = {® £ X : x T z > ® T y z for all 7 £ t/} . 


Proof: 

1. For each 7 ^ e, we define the closed halfspace 'H 1 = {x £ X : ||® — z\\ < ||® — 7 z||}. 
Then the Dirichlet fundamental domain D z is of the form 

v z =n ^7- 

76 s 

As an intersection of finitely many closed halfspaces, the set V z is a convex polyhedral 
cone [T5] . 

2. [26] . Theorem 6.6.11. 

3. The isotropy group of an ordinary point is trivial. Thus z T z > z T "/z for all 
7 £ Q \ {e}. This shows that 2 : lies in the interior of V z . 

4. Suppose that x £ V° is singular. Then the isotropy group Q x is non-trivial. Thus, 
there is a 7 £ Q \ {e} with x = 7 ®. This implies ® £ 7 V z n V z . Then ® £ dT> z 
is a boundary point of D z by [26] . Theorem 6.6.4. This contradicts our assumption 
® £ D° and shows that ® is ordinary. 

5. From ®,y® £ V z follows ||® — z|| = || 7 ® — z||. Since Q acts by isometries, we 
have ||® — z|| = || 7 ® — 7 z||. Combining both equations yields ||y® — z|| = H 7 ® — 7 z||. 
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This shows that jx £ dT> M . Let 7 ' £ Q be the inverse of 7 . Since 7 / e, we have 
7 ' 7 ^ e. Then 

11* - «ii = h* - = 11Y7* - 1' z \\ = 11* - y*n, 

where the second equation follows from isometry of the group action. Thus, ||a: — z|| = 
||a; — 7 'z|| shows that x £ dT) 

6 . The following equivalences hold for all 7 € Q: 


x £ V z 

O ||®- z || 2 < ||®- 7 z || 2 

^ ll*H 2 + \\z \\ 2 -2 xT z < ll*f + h z \\ 2 - 2 x T 'yz 

47 x T Z > X T "/Z. 

The last equivalence uses that Q acts on X by isometries. This shows the last property. 


Corollary 3.14 A generic point z £ X is ordinary. 


Proof: Suppose that z £ X is ordinary. Then there is a Dirichlet fundamental domain 
T> z . From Prop. 3.13 follows that all points of the open set T>° are ordinary. With z 
all representatives from [z] are ordinary. Then all points of 7 T>° are ordinary for every 
7 £ Q. The assertion holds, because the union (J 72 ?^ is open and dense in X. ■ 


3.3.2 The Weak Graph Representation Theorem 


Let w : Qu ~> X/Q be the bijective isometry defined in Corollary 3.2 A graph Z £ Qu 
is ordinary, if there is an ordinary representation z £ Z. In this case, all representations 
of Z are ordinary. The following result is an immediate consequence of Corollary |3.14| 

Corollary 3.15 A generic graph Z £ Qu is ordinary. 


The Weak Graph Representation Theorem describes the shape of a graph edit kernel 
space from a generic viewpoint. 

Theorem 3.16 (Weak Graph Representation Theorem) Suppose that (On, 8) is 
a graph edit kernel space. For each ordinary graph Z £ Qu there is an injective map 
p, : Qu X into a Euclidean space (X, ||-||) such that 

1. 6(Z,X) = \\p(Z) - p{X)\\ for all X £ Q n . 

2. 8{X,Y) < \\p,{X) - fj,(Y )|| for all X £ Q n . 

3. The closure T>^ = cl (p(Qu)) is a convex polyhedral cone in X. 

4 . We have X>° C pi(Qu) £ ^V 


16 






Proof: Let u : Gu —> X/G be the bijective isometry defined in Corollary 3.2 


1. Suppose that Z £ Gu is a graph with z £ w(Z) = [z]. Since Z is ordinary so is 2 . 
Let = V z be the Dirichlet fundamental domain centered at z, and T z C is a 
fundamental set. 


2. The fundamental set T z C X induces a bijection / : T z X/Q that maps each 
element x to its orbits [x]. Then the map 

d : Gu —■► A’, X 1 y f 1 (w(X)) 


is injective as a composition of injective maps. 

3. We show the first property. Let X be a graph. Then from Corollary |3.2| follows 
S(Z, A') = min {Hz' — x\\ : z' £uj(Z)}, 

where x = p(X). Since J~ z is subset of the Dirichlet fundamental domain T> z , we have 

S(Z,X) = \\z-x\\. 


This shows the assertion. 


4. We show the second property. From the second part of this proof follows 
p(X) = f~ 1 (uj(X)). This implies /z(X) £ uj(X). Thus, /i maps every X to exactly 


one representation x £ aj(X). Then the assertion follows from Corollary 3.2 

5. We have p(Gu) = X z . Then the third and fourth property follow from Prop. 3.13 


We call the map /z : Qu ~> X an alignment of Gu along Z. The polyhedral cone 
is the Dirichlet fundamental domain centered at /z( Z). Note that an alignment along 
Z is not unique. 

The first property of the Weak Graph Representation Theorem states that there is 
an isometry with respect to a generic graph Z into some Euclidean space. The second 
property states that the alignment /z is an expansion of the graph space. Properties 
(3) and (4) say that the image of an alignment along a generic graph is a dense subset 
of a convex polyhedral cone. A polyhedral cone is the intersection of finitely many 
half-spaces. Figure [l] illustrates the statements of Theorem |3.16[ 

According to Theorem |3.16| an alignment fi along a generic graph Z is an isometry 
with respect to Z , but generally an expansion of the graph space. Next, we are inter¬ 
ested in convex subsets U C Gu such that /z is isometric on U , because these subsets 
have the same geometrical properties as their convex images /z(W) in the Euclidean 
space X by isometry. To characterize such subsets, we introduce the notion of cone 
circumscribing a ball for both metric spaces, the Euclidean space (A,||-||) and the 
graph kernel edit space (G,S). 

Definition 3.17 Suppose that the metric space (XI, d) is either a Euclidean space or 
graph edit kernel space. Let z £ M and let p > 0. A cone circumscribing a ball B(z , p) 
is a subset of the form 

C(z, p) = {x £ XI : 3 A > 0 s.t. Ax £ B(z, p)} . 
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Figure 1: Illustration of the Weak Graph Representation (WGR) Theorem. Suppose 
that [i : Gu —> X is an alignment along a generic graph Z. The box on the 
left shows the graph space Gu together with graphs X,Y,Z £ Gu- The box 
on the right depicts the image y(Gu) as a region of the Euclidean space X 
together with the images x = n(X), y = y(Y), and 2 : = y,(Z). Property (1) 
of the WGR Theorem states that /i is isometric with respect to Z. Distances 
are preserved for S(Z, X) = \\z — x\\ and S(Z, Y) = \\z — y\\ as indicated by 
the respective solid red lines. Property (2) of the WGR Theorem states that 
S{X,Y) < (I* - y || as indicated by the shorter dashed red line connecting X 
and Y in Gu compared to the longer dashed red line connecting the images 
x and y. Properties (3) and (4) of the WGR Theorem state that the closure 
V M of the image y(Gu) i n X (right box) is a convex polyhedral cone. Points 
of T>n without pre-image in Gu are boundary points as indicated by the holes 
in the dotted black line. Since a polyhedral cone can be regarded as a set 
of rays emanating from the origin, the side opposite of 0 of the Dirichlet 
fundamental domain is unbounded. 
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The next results states that an alignment along an ordinary graph induces a bijective 
isometry between cones circumscribing sufficiently small balls. 

Theorem 3.18 An alignment p, : Q % —> X along an ordinary graph Z can be restricted 
to a bijective isometry from C(Z, p) onto C{z,p) for all p such that 

0 < P< P* = \ min {|| 2 - £c|| : x £ dT>^}, 


where z = p(Z). 
Proof: 


1. According to Corollary 3.2 we have Qu — X/Q. The group £ is a discontinuous 


group of isometries. Suppose that 2 £ Z. Since Z is ordinary, the isotropy group Q z 
is trivial. Then the natural projection ir : X —» X/Q induces an isometry from B(z , p) 
onto B(n(z),p) for all p such that 


0 < P < ^min{|| 2 - 72 || : 7 € 5\{e}} 
by [26,, Theorem 13.1.1. This implies an isometry from B(z,p) onto B(Z,p). 


(5) 


2 . Since Z is ordinary, we have 2 £ V/ by Prop. 3.13 The ball B(z,p ) is contained 
in the open set T>° for every radius p satisfying eq. ([sj. To see this, we assume that 
B(z, p) contains a boundary point x of V^. Then there is a 7 £ Q \ {e} such that 


I* ~ 721 | = ||a= — ^|| < p- 


We have 


l |2 721| = II2 x x 721 

< || 2 -®|| + || 72 -aj|| 

< 2 p. 

Since p satisfies eq. ([5|, we obtain a chain of inequalities of the form 

^11*-7*11 <P< | II*-7*11- 

This chain of inequalities in invalid, because 2 is ordinary and 7 / From the 
contradiction follows B(z,p) C V°. 

3. Let p £ ]0,p*]. We show that C(z,p) C T>°. The cone C(z,p) is contained 
in due to part one of this proof and convexity of Suppose that there is 
a point x £ C(z, p) D dV M . Then there is a 7 £ Q \ {e} such that x lies on the 
hyperplane % separating the Dirichlet fundamental domains and 7 Consider 
the ray £+ = {Aa; : 0 < A} C C(z,p). Two cases can occur: (1) either £+ C H or 
(2) £+ nW = {x}. The first case contradicts that B(z,p) is in the interior of X> M , 
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because the ray Cf. passes through B(z,p). The second case contradicts convexity of 
"D ;J . Thus we proved C{z,p) C X>°. 

4. Let p £ ]0, p*] and X, Y £ C(Z,p). Then there are positive scalars a,b > 0 such 
that X' = aX and Y' = bY are contained in B(Z, p) by definition of C(Z, p). Suppose 
that x' £ X' and y' £ Y' are representations of X' and Y' such that x\y' £ B(z,p) C 
T>°. Since Q is a subgroup of the general linear group, we have x' = ax and y' = by 
with x £ X and y £ Y. From part three of this proof follows that x = x'/a and 
y = y'/b are also contained in X>°. Applying Prop. [375] yields 

k(X', Y') = (ax) T (by) = ab ( x T y ) = ab ■ k(X, Y) 


This implies 6 (X,Y) = \\x — y\\. This shows a bijective isometry from C(Z 1 p) onto 
y{C{Z lP )). 

5. We show that p(C(Z,p)) = C(z,p). From part four of this proof follows that 
p(C(Z,p)) C C(z,p). It remains to show that C(z,p) C p(C(Z,p)). Let x £ C(z,p). 
Then there is a scalar a > 0 such that x' = ax is in the ball B(z,p) C C(z 1 p). From 
the first part of the proof follows that there is a graph X' £ B(X\p) C C(Z,p) with 
x' £ X'. By definition of C(Z,p), we have A' = X'/a is also in C(Z,p). We need to 
show that p(X) = x. From the third part of this proof follows that C(z, p) C 2?°. This 
implies that x £ T>°. From Prop. 


3.13 


follows that there is no other representation 
M according to the Weak Graph 


x' £ X contained in V^. Since p is surjective onto V 
Representation Theorem, we have p{X) = x. This shows that C(z,p) C p(C(Z,p)). 


The maximum radius p* in Theorem |3.18| is half the minimum distance of 2 from 
the boundary of its Dirichlet fundamental domain "D M . The circular cone C(z, p*) in X 
is wider the more centered z is within its Dirichlet fundamental domain T)^. Then by 
isometry, the cone C(Z,p*) in Qu is also wider. Note that for every generic graph Z, 
the circular cone C(z,p*) never collapses to a single ray. Figure [2] visualizes Theorem 

Era 

A direct implication of the previous discussion is a correspondence of basic geomet¬ 
rical concepts between graph edit kernel spaces and their images in Euclidean spaces 
via alignment along an ordinary graph. The next result summarizes some correspon¬ 
dences. 

Corollary 3.19 Let p : Qu —> X be an alignment along an ordinary graph Z £ Qw 
Then the following statements hold for all X £ Q-j^: 

1. k(Z,X) = p(Z) T p(X) for all X £ Q n . 

2. l(X) = HmPOII 

3. <(Z,X) = <(p(Z),p(X)). 

f. Z and X are orthogonal p{Z) and p(X) are orthogonal. 

5. Z is orthogonal to a subset U C => p(Z) is orthogonal to p{U). 
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Figure 2: 



Illustration of Theorem 3.18 The two columns represent alignments [i along 
different graphs Z into the Euclidean space X. The restriction of /x to the 
isometry cone C(Y) is an isometric isomorphism into the (round) hypercone 
C(z), where z = /. i(Z ). is the image of graph Z. The isometric and isomor¬ 
phic cones are shaded in dark orange. The hypercone C(z) is wider the more 
z is centered within its Dirichlet fundamental domain D u . 
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4 Discussion 


This contribution studies the geometry of graph edit kernel spaces. Results presented 
in this paper serve as a basis for statistical data analysis on graphs. The main result is 
the Graph Representation Theorem. It states that under mild assumptions graphs are 
points of a geometrical space, called orbit space. Orbit spaces are well investigated and 
easier to explore than the original graph edit kernel space. Consequently, we derived 
a number of results from orbit spaces useful for statistical data analysis on graphs and 
translated them to graph edit kernel spaces. 

In the remainder of this section, we conclude with indicating the significance and 
usefulness of the results for statistical pattern recognition on graphs. 

Graph edit kernels and graph metrics induced by edit kernels are used in numerous 
applications. Consequently, there is ongoing research on devising graph matching 
algorithms for computing graph edit kernels and their induced metric [D 0 m HI ED 

M HQH3 HH Ell 131133] ■ 

The notion of angle, length, and orthogonality together with the Graph Represen¬ 
tation Theorem and its weak version are useful for a geometric interpretation of linear 
classifiers generalized to graph edit kernel spaces |20l 121] . 

One of the most fundamental statistic is the concept of mean of a random sample 
of graphs. The Weak Graph representation Theorem, Theorem 1 3.3 1 and Theorem |3.18| 
partly in conjunction with results from [5] are useful for addressing the following issues: 

1. Existence of a sample mean of graphs. 

2. Uniqueness of a sample mean of graphs. 

3. Strong consistency of sample mean of graphs. 

4. Midpoint property of a mean of two graphs. 

5. Vectorial characterization of sample mean of graphs. 
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